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ABSTRACT 

Autoimmune regulator (AIRE) is a transcription 
factor that induces the expression of a large subset 
of otherwise strictly tissue restricted antigens in 
medullary thymic epithelial cells, thereby enabling 
their presentation to developing T cells for negative 
selection. Mutations in AIRE lead to autoimmune- 
polyendocrinopathy-candidiasis-ectodermal dystro- 
phy (APECED), a rare monogenetic disease. 
Although it has been reported that AIRE interacts 
with proteins involved in nuclear transport, DNA- 
damage response, chromatin remodeling, transcrip- 
tion and pre-mRNA-splicing, the precise mechanism 
of AIRE-induced tissue restricted antigen expres- 
sion has remained elusive. In this study, we in- 
vestigated an APECED patient mutation that 
causes the loss of the extreme C-terminus of AIRE 
and found that this mutant protein is transcript- 
ionaly inactive. When tethered heterologously to 
DNA, this domain could stimulate transcription 
and splicing by itself. Moreover, the loss of this 
C-terminus disrupted interactions with the positive 
transcription elongation factor b (P-TEFb). Via P-TEFb, 
AIRE increased levels of RNA polymerase II on and 
enhanced pre-mRNA splicing of heterologous and 
endogenous target genes. Indeed, the inhibition of 
CDK9, the kinase subunit of P-TEFb, inhibited AIRE- 
induced pre-mRNA splicing of these genes. Thus, 
AIRE requires P-TEFb to activate transcription 
elongation and co-transcriptional processing of 
target genes. 

INTRODUCTION 

Tolerance to self is an essential element of the immune 
system and when tolerance is dysfunctional, autoimmune 



diseases can arise. Central tolerance is estabhshed by 
negative selection of developing T cells in the thymus 
(1). Most autoimmune diseases are dependent on many 
genetic loci, but a rare autoimmune disease, 
autoimmune-polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED) is monogenetic. APECED patients 
have mutations in the AIRE gene (2,3), which encodes the 
autoimmune regulator (AIRE). AIRE is critical for the 
expression of a number of otherwise tissue restricted 
antigens (TRAs) in medullary thymic epithelial cells 
(mTECs) (4). Indeed, Aire~^~ mice develop multi-organ 
autoimmunity with inflammatory infiltrates and auto- 
reactive antibodies, which is caused by dysregulated nega- 
tive selection of developing T cells in the thymus (4), 
whereas the severity and organs affected are strain de- 
pendent (5-7). mTECs from these mice express a much 
smaller TRA repertoire than their wild-type (WT) 
counterparts. 

Previous studies suggested a role for AIRE in tran- 
scription (8). AIRE is localized to the nucleus where it 
accumulates in distinct nuclear structures (9), and when 
expressed transiently, it activates the expression of en- 
dogenous genes as well as exogenous plasmid targets 
(10). In addition, AIRE interacts with the CREB- 
binding protein (11) and the positive transcription elong- 
ation factor b (P-TEFb) (10). AIRE contains several 
domains that are present in various transcription factors. 
First, it has a SplOO, Aire-1, NucP41/75 and DEAF-1 
domain (SAND) that has been suggested to bind DNA 
(12). Second, AIRE contains two plant homeodomains 
(PHDl and PHD2). PHDl binds to histone-3 with 
non-methylated lysine at position 4 (H3K4me^) (13,14), 
which is abundant in transcriptional inactive chromatin. 
It was also proposed to be an E3-ubiquitin ligase (15). 
PHD2 might represent an accessory transcription activa- 
tion domain (TAD) (16,17). Furthermore, AIRE can also 
associate with proteins involved with nuclear transport, 
DNA-damage response, chromatin remodeling, transcrip- 
tion and pre-mRNA splicing (18). 
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Protein coding genes are transcribed by RNA polymer- 
ase II (RNAPII) and their expression is regulated at many 
different steps (19). The C-terminal domain (CTD) of 
human Rbpl, the largest RNAPII subunit, consists of 
52 Y1S2P3T4S5P6S7 hep tapep tide repeats. Serine residues 
in these repeats are phosphorylated by different CTD- 
kinases. Thus, the pattern of CTD phosphorylation 
changes dynamically as the RNAPII transitions through 
different phases of transcription (20). CTD phosphory- 
lated at serine- 5 (S5P) is a hallmark of transcription initi- 
ation, and is accompHshed by cyclin-dependent kinase 7 
(CDK7), a component of the general transcription factor 
TFIIH (21). On the other hand, CTD phosphorylated at 
serine-2 (S2P) is a hallmark of transcription elongation, 
and is catalyzed by P-TEFb (22), composed of the CDK9 
kinase and the regulatory cyclins Tl or T2a/b (23). 
Furthermore, pre-mRNA splicing occurs co-transcrip- 
tionaly, and the phosphorylated CTD is bridging these 
processes by binding components of pre-mRNA splicing 
and cleavage/polyadenylation machineries (24,25). Thus, 
pre-mRNA processing factors associate with the tran- 
scription elongation complex and the degree and type of 
CTD phosphorylation define the specificity and affinity of 
these interactions (26). 

In this study, we examined the functional relevance of 
aire's C-terminus by characterizing the molecular defect 
of an APECED patient mutation in AIRE. A previous 
report demonstrated that AIRE binds P-TEFb (10). We 
now mapped this interaction to the C-terminal 40 residues 
of AIRE and validated this finding with biochemical and 
functional assays. Indeed, our findings reveal that the 
C-terminal 40 residues of AIRE are critical for the recruit- 
ment of P-TEFb to target genes, leading to increased 
phosphorylation of the CTD, transcription elongation 
and pre-mRNA spHcing. 

MATERIALS AND METHODS 

Cell lines 

HeLa and 293T cells were obtained from ATCC and 
cultured in DMEM supplemented with 10% FBS and 
antibiotics. Mouse 1C6 mTECs were a kind gift from 
Dr M. Kasai [National Institute of Infectious Diseases, 
Tokyo, Japan (27)] and were cultured in S-MEM 
(Invitrogen) supplemented with 10% FBS (Invitrogen), 
5mM Glutamax (Invitrogen), 0.2 |iM (3-mercaptoethanol 
(Invitrogen), 0.5 |ig/ml hydrocortisone hemisuccinate 
(Sigma- Aldrich) and 1 mM sodium pyruvate (Invitrogen). 

Plasmids 

Flag- AIRE was a kind gift from Dr M. Matsumoto 
[University of Tokushima, Kuramoto, Tokushima, 
Japan (15)] and GAL. AIRE was a kind gift from 
Dr P. Peterson (University of Tartu, Tartu, Estonia). 
hINSLuc was kind gift from Dr M. German (UCSF, 
San Francisco, CA, USA). G5HIV2SVEDA was a kind 
gift from Dr R. KornbHht [Universidad de Buenos Aires, 
Buenos Aires, Argentina (28)] and G5HIV2dsx ± ESE 
were a kind gift from Dr J. Blencowe [University of 
Toronto, Toronto, Ontario, Canada (29)]. G5CAT has 



been previously described (30). HA-CDK9 and myc- 
cyclinTl have been previously described (31,32). 

The 505fsx520 mutation was generated in WT 
GAL.AIRE and WT Flag-AIRE by PCR mutagenesis 
using primers: (forward) AGGATGACACTCCAGTCA 
CGAGC and (reverse) GCTCGTGACTGGAGTGTCA 
TCCT. GAL.AIRE deletion mutants were generated 
with PCR mutagenesis deletion with QuikChange II XL 
Site-Directed Mutagenesis Kit (Stratagene) using primers: 

425F TTGACTGTATCGCCGGAATTCCAGCAGAA 

CCTGGCTCCTGGT and 
425R ACCAGGAGCCAGGTTCTGCTGGAATTCCG 

GCGATACAGTCAA; 
488F ACTGTATCGCCGGAATTCCTGGCCCCCAGC 

CCCGCC and 
488R GGCGGGGCTGGGGGCCAGGAATTCCGGC 

GATACAGT. 

Luciferase assay 

1C6 mTECs were plated onto 6-well plates and transfected 
using Fugene 6 (Roche) to co-express empty plasmid 
vector, Flag-AIRE or Flag-AIREfsx, and hINSLuc and 
pRL-CMV (Promega). Cells were harvested after 24 h 
and luciferase assays were performed using the Dual 
Luciferase kit (Promega) according to the manufacturer's 
instructions. Firefly luciferase activity of each sample was 
normalized to renilla luciferase readings. Averages were 
calculated from at least three separate transfections. 
Fold activation represents the ratio between the effector 
activated transcription and the activity of the reporter 
with empty vector. Error bars represent SEM. 

CAT assay 

1C6 cells were transfected with Fugene 6 (Roche) to 
co-express G5CAT and GAL-chimeras. CAT assays 
were performed as in Oven et al. (10) and are described 
in detail in Supplementary Methods. 

RNA isolation and RT-PCR 

Cells were harvested in Trizol (Invitrogen) and RNA was 
isolated according to manufacturer's specifications. RNA 
was treated with DNAfree (Ambion) and reverse 
transcribed with M-MLV RT (Invitrogen) with random 
or specific primers. Detailed protocol and primers are 
described in Supplementary Methods. 

Chromatin immunoprecipitation 

Chromatin immunoprecipitation (ChIP) protocol was 
adapted from Carey et al. (33), and is described in detail 
in Supplementary Methods. 

Co-immunoprecipitation 

293T cells were transfected with plasmids encoding HA- 
CDK9, Myc-Cyc Tl and GAL, WT GAL.AIRE or 
mutant GAL.AIREfsx fusion proteins using calcium 
phosphate. Cells were harvested and resuspended in hypo- 
tonic buffer (10 mM HEPES pH 7.5, 1.5 mM MgCl2, 
10 mM KCl, 5mM EDTA, 0.05% NP-40, 1 mM DTT) 



7910 Nucleic Acids Research, 2011, Vol. 39, No. 18 



supplemented with protease inhibitor cocktail (Sigma- 
Aldrich) and incubated on ice for 15min. Cell lysates 
were centrifuged for lOmin at 1000^ (4°C) and super- 
natant was removed as completely as possible and dis- 
carded. Nuclei were resuspended in sonication buffer 
(20 mM HEPES pH 7.5, 150mM NaCl, 1.5 mM MgC12, 
0.2 mM EDTA, 0.1% NP-40, 1 mM DTT) supplemented 
with protease inhibitor cocktail and gently sonicated on a 
Bandelin sonoplus sonifier (2 x 10 s pulse at power setting 
20%, on ice at all times). Nuclear lysates were centrifuged 
at maximum speed for 20 min to remove insoluble material 
and supernatants were used for immunoprecipitation. 
Two micrograms of antibodies were added and the samples 
were incubated 4h at 4°C, then protein G Dynabeads 
(Invitrogen) were added and samples were incubated 
1 additional hour. Precipitates were washed at room 
temperature four times with cold PBS supplemented 
with 0.05% NP-40 and protease inhibitor cocktail. 
Precipitated proteins were eluted by addition of Ix 
SDS-PAGE loading buffer and heating to 95°C for 
5 min. Precipitates were separated by SDS-PAGE and 
proteins were detected with specific antibodies by 
western blotting. 

Antibodies and chemicals 

Antibodies used for western blotting, co-immunoprecipi- 
tation (co-IP), ChIP or immunofluorescence: anti-AIRE 
(sc-33188 Santa Cruz Biotechnology), anti-Flag (M2 
Sigma-Aldrich), anti-CDK9 (sc-484 Santa Cruz 
Biotechnology, ab 10874 Abeam), anti-RNAPII (ab5408 
Abeam, MMS-126R Covance), anti-S2P (ab5095 
Abeam), anti-GAL (sc-510 Santa Cruz Biotechnology), 
anti-GAPDH (AM4300 Ambion), anti-oc Tubulin 
(DM1 A Sigma-Aldrich), anti-U5-116 kD (A300-957A 
Bethyl labs) rabbit and mouse control IgG (Santa Cruz 
Biotechnology). Flavopiridol was obtained from 
Sigma-Aldrich. 

RESULTS 

The frame-shift mutant AIRE protein cannot activate 
transcription 

To reveal how AIRE induces the expression of TRAs, we 
focused on an APECED patient mutation in AIRE, 
which encodes the frame-shift mutant AIRE 505fsx520 
(AIREfsx) protein with a truncated C-terminus, whose 
other domains important for homodimerization and re- 
cruitment to chromatin remain intact. The mutation was 
identified in two siblings with APECED by Ishii et al. (34). 
It represents a deletion of a guanosine at position 1640, 
which introduces a reading frame-shift at residue 505 and 
causes premature termination at residue 520 in the mutant 
AIREfsx protein. A schematic representation of AIRE 
protein domains and changes caused by the reading 
frame-shift mutation are depicted in Figure lA and B. To 
characterize functionally the mutant AIREfsx protein, we 
used a co-transfection assay with plasmid effectors and a 
luciferase reporter target that was under the control of 
the human insulin promoter (hINSLuc) (Figure IC). 



When the WT AIRE protein was co-expressed transiently, 
the luciferase activity of hINSLuc was 35-fold over that 
of the empty plasmid vector (Figure IC, compare lanes 2 
and 1), whereas the mutant AIREfsx protein was inactive 
(Figure IC, compare lanes 3 and 1). 

We also examined the expression of Ins2, an endogen- 
ous TRA gene (Figure ID). Levels of mouse Ins2 tran- 
scripts from 1C6 mTECs transiently expressing WT AIRE 
or mutant AIREfsx proteins were quantified. While the 
WT AIRE protein induced Ins2 mRNA expression 
6-fold (Figure ID, compare lanes 2 and 1), the mutant 
AIREfsx protein had no effect (Figure ID, compare lanes 
3 and 1). Similar levels of WT AIRE and mutant AIRE 
proteins were expressed in these cells (Figure IC, D). 

AIRE is localized to the nucleus and has a typical 
punctate staining in AIRE-expressing mTECs in the thy- 
mus (35,36) as well as in transfected cells of various types 
(9). We also localized WT AIRE and mutant AIREfsx 
proteins in HeLa cells (Supplementary Figure SI). All 
examined AIRE proteins were found in the nucleus. 
However, while the WT AIRE protein showed a discrete 
punctuate staining, the mutant AIREfsx protein staining 
revealed a more dispersed pattern. Similar dispersed nu- 
clear staining was also observed for a C-terminally deleted 
mutant AIRE (1-509) protein, confirming that it was 
indeed due to the loss of a functional C-terminus rather 
than mediated by the new C-terminal residues encoded by 
the frame-shifted reading frame. We conclude that the 
C-terminus of AIRE is required for the expression of 
TRAs and for the punctate nuclear distribution character- 
istic of the WT AIRE protein. 

The C-terminus of AIRE is an autonomous TAD 

To examine if the C-terminus of AIRE can act as an in- 
dependent TAD, we fused WT and different N-terminally 
truncated versions of AIRE as well as those bearing the 
505fsx520 mutation to the GAL4 DNA-binding domain 
(GAL) (Figure 2, left panel). These fusion proteins were 
expressed transiently in 1C6 mTECs with G5CAT, con- 
sisting of a CAT reporter gene with five GAL4 upstream 
activating sequences (UASs) in its promoter. WT 
GAL.AIRE fusion protein activated G5CAT -20-fold 
over GAL (Figure 2, middle panel, compare lanes 1 
and 2). A similar activation was observed for the GAL. 
AIRE425 fusion protein (Figure 2, middle panel, lane 3), 
which included only the PHD2 and the C-terminus of 
AIRE. Importantly, the GAL.AIRE488 chimera, 
including only the C-terminus, also activated G5CAT 
12-fold (Figure 2, middle panel, lane 4). In contrast, 
none of the mutant AIREfsx proteins (Figure 2, middle 
panel, lanes 5-7) activated this plasmid target. The expres- 
sion and size of mutant fusion proteins was confirmed in 
cells (Figure 2, right panel). Of note, transcriptionaly 
inactive mutant proteins were more stable than their WT 
counterparts (Figure 2, right panel, compare lanes 2-4 and 
5-7). Tethering AIRE to GAL enabled us to examine 
transcription activation independent of recruitment, 
thereby specifically mapping the core TAD to the 57 
residues in the C-terminus of AIRE. The increased stabil- 
ity of our mutant proteins agrees with a previous 



Nucleic Acids Research, 2011, Vol. 39, No. 18 7911 



1 NLS 
AIRE PhSR 



425 488 545 




AIREfsx I HSR 

B 488 545 

AIRE LAPSPARLAPGPAKDDTASHEPALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS 
AIREfsx LAPSPARLAPGPAKDDTPX^5^PI,CrGMrn/SPF 



505 



520 



Luciferase activity 



RNA (RT-qPCR) 



c 
o 

> 
o 

CO 

o 




> 
o 

o 




C AIRE AIREfsx 



WB 



1 



AIRE 
"AIREfsx" 



AIRE AIREfsx 



1 



Figure 1. The mutant AIREfsx protein does not activate transcription. (A) Schematic representation of WT AIRE and mutant AIREfsx proteins. 
From the N-terminus, domains are: homogenously staining region (HSR); nuclear locahzation signal (NLS), SplOO, Aire-1, NucP41/75 and DEAF-1 
domain (SAND); plant homeodomains 1 and 2 (PHD); prohne rich region (PRR); WT (C) and mutant alternative (fsx) C-termini. On top, numbers 
refer to borders of PHD2 and the full length Aire protein. Below numbers refer to the first frame-shifted residue and the end of the termination 
sequence (505fsx520). (B) Residues in the C-terminus of WT and mutant AIREfsx proteins. Numbers are as in (A). Changed residues are presented 
with itahc letters. (C) Mutant AIREfsx protein does not activate transcription. Relative luciferase activity of plasmid target (hINSLuc) for WT AIRE 
and mutant AIREfsx proteins is presented as fold activation above the empty plasmid vector (C). Values represent the average of three independent 
transfections with standard errors of the mean (SEM) represented by error bars. Expression levels for AIRE proteins are presented below the bar 
graph (WB). (D) Mutant AIREfsx protein does not induce expression of the mouse Ins2 gene. mlns2 RNA levels from WT AIRE and mutant 
AIREfsx protein expressing 1C6 mTECs are presented as fold activation above the empty plasmid vector (C). Values represent the average of three 
independent transfections with SEM represented by error bars. Levels of Ins2 RNA were normalized to P-actin. Expression levels for AIRE proteins 
are presented below the bar graph (WB). 
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Figure 2. The C-terminus of AIRE is an autonomous TAD. In the left panel is a schematic representation of GAL fusion proteins and their names. 
The CAT activity of the plasmid target (G5CAT) for the WT and mutant GAL. AIRE fusion proteins in 1C6 mTECs is presented as fold activation 
above that for GAL alone. Values represent the average of three independent transfections with SEM represented by error bars. Panel on the right 
presents expression levels of GAL and GAL.AIRE chimeras (WB). 
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observations that active transcription factors have shorter 
half-hves in cells (37). We conclude that the 57 residues of 
AIRE C-terminus represent a necessary and sufficient 
TAD of AIRE. 

The C-terminus of AIRE interacts with P-TEFb 

Given the functional importance of the C-terminus of 
AIRE, we wanted to determine if it mediates its inter- 
actions with P-TEFb (10). Therefore, we performed co- 
immunoprecipitations (co-IPs) from HeLa cells transiently 
expressing epitope-tagged CDK9 protein together with 
GAL, WT GAL.AIRE or mutant GAL.AIREfsx fusion 
proteins (Figure 3A). AIRE does not require a cell-specific 
co-factor (13,18). Since other cell lines beside mTECs were 
used in most pubHcations, we used HeLa cells for our 
binding studies. As expected, CDK9 was co-precipitated 
with the WT GAL.AIRE fusion protein (Figure 3A, 
lane 2) but not with GAL (Figure 3 A, lane 1). In 
contrast, no such interaction was observed with the 
mutant GAL.AIREfsx fusion protein (Figure 3 A, lane 3). 

To determine if the loss of P-TEFb binding abolished 
target gene recruitment of P-TEFb by the mutant 
AIREfsx protein, we performed ChlPs with anti-CDK9 
antibodies. Recruitment of AIRE to its target genes 
depends on the binding between PHDl and H3K4me^ 
(13,14) and as yet undetermined other interactions. 
Thus, to isolate our study from issues related to AIRE 
recruitment to DNA, we also chose the GAL-tethering 
approach for these experiments. We used 
G5HIV2SVEDA, containing from the 5' direction 5 
UASs, the TATA box from the HIV2 promoter, and a 
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Figure 3. Mutant AIREfsx protein does not interact with P-TEFb. 
(A) Mutant AIREfsx protein does not interact with CDK9. GAL, 
GAL.AIRE and GAL.AIREfsx chimeras were expressed in HeLa 
cells. Proteins were immunoprecipitated with anti-GAL antibodies 
(aGAL) and the co-immunoprecipitation with CDK9 was detected 
with anti-CDK9 antibodies by western blotting (Top panel, IP:WB). 
The lower three panels contain 5% of input proteins for immunopre- 
cipitations (WB). (B) Mutant AIREfsx protein cannot recruit CDK9 to 
DNA. Sonicated lysates from HeLa cells co-expressing GAL.AIRE or 
GAL.AIREfsx fusion proteins and G5HIV2SVEDA were immunopre- 
cipitated with anti-CDK9 (aCDK9) antibodies for ChlP. Enrichment 
of amplicons is presented as fold enrichment over IgG control (CDK9/ 
IgG). Black and striped bars represent values for the WT GAL.AIRE 
and mutant GAL.AIREfsx chimeras with SEM depicted by error bars. 
G5HIV2SVEDA and ChIP amphcons are depicted above and below 
the bar graph, respectively (promoter, Pr; 5'-end of gene, 5'; 3'-end of 
gene, 3', polyadenylation signal, pA; untranscribed region, U). 
G5HIV2SVEDA contains five GAL binding sites (UAS) and the 
TATA box (T) upstream of the a-globin gene (agl) with the inserted 
fragment from the fibronectin 1 gene (FNl). 



1.5 kb fragment with the human a-globin gene, into 
whose third exon a 3 kb fragment from the human fibro- 
nectin 1 gene was inserted (Figure 3B) (28). This plasmid 
target has a long coding region (4 kb), which was optimal 
for our ChlP-qPCR analyses (promoter (Pr), 5^- and 
3^-ends of the gene and polyadenylation site (pA), and a 
control untranscribed region (U)). Furthermore, this plas- 
mid target had low basal levels of expression but could be 
induced potently by the co-expression with a GAL-fused 
TAD (28) such as the WT GAL.AIRE fusion protein 
(Supplementary Figure S2). The target gene analyzed in 
our ChlPs is depicted on top of Figure 3B. Below it are 
presented our amphcons for ChlP. Due to its artificial 
nature, primers designed to ampHfy different regions of 
G5HIV2SVEDA were specific for the plasmid target and 
did not ampHfy endogenous cellular genes. Indeed, we 
observed 1.5- to 2.5-fold enrichment of P-TEFb on this 
gene (Figure 3B, lower panel, black bars, Pr, 5^ 3^ and pA 
amphcons) above background levels (Figure 3B, lower 
panel, black bar, U amplicon) in cells expressing the WT 
GAL.AIRE fusion protein. In contrast, even though tran- 
siently expressed proteins were expressed at comparable 
levels (Supplementary Figure S3) the mutant GAL. 
AIREfsx fusion protein failed to recruit P-TEFb to 
G5HIV2SVEDA (Figure 3B, compare striped and black 
bars), thus estabhshing the functional role for interactions 
between P-TEFb and the C-terminus of AIRE in this pro- 
cess. Based on these results, we conclude that the AIRE 
TAD recruits P-TEFb to target genes and that the mutant 
AIREfsx protein has lost this property. 

AIRE increases the phosphorylation of the CTD 

Since our assays defined the AIRE TAD and its inter- 
actions with P-TEFb, we wanted to analyze the phosphor- 
ylation of CTD on AIRE-induced genes. Since CDK9 
phosphorylates S2P, we focused on this modification. 
G5HIV2SVEDA was co-expressed transiently with the 
WT GAL.AIRE or mutant GAL.AIREfsx fusion proteins 
in HeLa cells. Targeting of GAL fusion proteins to 
G5HIV2SVEDA was verified by ChlP with anti-GAL 
antibodies (Figure 4B). Indeed, the WT GAL.AIRE and 
mutant GAL.AIREfsx fusion proteins were recruited to 
DNA at comparable levels (Figure 4B, black and striped 
bars). Moreover, we observed a high occupancy of 
our GAL fusion proteins at the promoter (Pr) where the 
UASs are located (Figure 4B, black and striped bars, Pr). 
The 5^ amplicon was also enriched in the GAL ChlP, 
although these levels were approximately 20-fold lower 
than those for the Pr amplicon (Figure 4B, black 
and striped bars, compare Pr and 5^ amphcon). We 
conclude that our assay had a high resolution and the 
fusion proteins were recruited to DNA at comparable 
levels. 

Whereas we found ~2-fold more RNAPII on the gene 
and at polyadenylation signal (Figure 4C, compare black 
and striped bars, 5^ 3^ and pA amphcons) in WT 
GAL.AIRE fusion protein-expressing cehs compared to 
levels in mutant GAL.AIREfsx fusion protein-expressing 
cehs or to levels in the untranscribed region (Figure 4C, 
compare black, U amplicon), its levels in the promoter 
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Figure 4. AIRE increases S2P phosphorylation. (A) Schematic representation of the G5HIV2SVEDA minigene and ChIP amphcons is as in 
Figure 3B. (B-D) ChIP of G5HIV2SVEDA. Transfected ceUs were analyzed with specific antibodies as in Figure 3B. (B) ChIP with anti-GAL 
(aGAL) antibodies. (C) ChIP with anti-RNAPII (aRNAPII) antibodies. (D) ChIP with anti-S2P (aS2P) antibodies. (E) AIRE increases relative 
levels of S2P. To evaluate relative levels, levels of S2P were normalized to those of RNAPII (S2P/RNAPII) for each amphcon. Error bars 
represent SEM. 



region (Pr) did not differ significantly between the WT 
GAL.AIRE and mutant GAL.AIREfsx chimeras 
(Figure 4C, compare black and striped bars, Pr). Since 
the occupancy of RNAPII at the promoter in cells express- 
ing the mutant GAL.AIREfsx fusion protein was above 
that at the untrancribed amplicon (U), this finding indi- 
cates that RNAPII is recruited to the promoter independ- 
ently of its interaction with P-TEFb. 

In contrast, levels of S2P were markedly higher on the 
gene in the presence of the WT GAL.AIRE fusion protein 
compared to the mutant GAL.AIREfsx fusion protein 
(Figure 4D, compare black and striped bars, 3^ and pA 
amphcons). Indeed, levels were ~2-fold above those from 
mutant GAL.AIREfsx fusion protein-expressing cells in 
the promotor (Pr) and 5^-end of the gene, 3-fold in the 
3^-end of the gene and 8-fold at the polyadenylation site 
(pA) (Figure 4D, compare black and striped bars). To 
determine relative levels of S2P, we divided amounts of 
S2P by those of RNAPII for each amplicon. When they 
were examined in WT GAL.AIRE fusion protein- 
expressing cells and compared to the untranscribed region 
(U), 1.5-, 2- and 3-fold enrichment was observed at the 
promoter (Pr), at the 3^-end of the gene (3^ and at the 
polyadenylation site (pA), respectively (Figure 4E, black 
bars). In contrast, in mutant GAL.AIREfsx fusion 
protein-expressing cells, relative S2P levels throughout 
the gene were equivalent to those of the untranscribed 
region (U) (Figure 4E, striped bars). Thus, AIRE- 
induced activation of transcription is associated with 
elevated levels of RNAPII and S2P in the body and 
3^-end of the gene, which depend on the AIRE TAD. 



AIRE increases pre-mRNA splicing 

Since pre-mRNA splicing is a co-transcriptional process 
influenced by CTD phosphorylation, we wanted to 
examine its regulation by AIRE. We used G5HIV2dsx, a 
plasmid splicing reporter target containing a minigene of 
two exons with a weak splice site from the Drosophila dsx 
gene with or without an exonic spHcing enhancer (ESE) 
(Figure 5 A, Supplementary Figure S4A). ESEs are re- 
cognized by SR proteins, which interact with the spHcing 
machinery to stimulate the inclusion of weak exons (38). 
Indeed, without a strong activator recruited to its 5^ UASs 
and CTD phosphorylation, G5HIV2dsx is transcribed 
at low levels and its transcripts are processed ineffi- 
ciently (29,39). However, because it has a very short coding 
sequence (84 and 70 nucleotide exon fragments with a 114 
nucleotide intron), we used the longer G5HIV2SVEDA 
for our ChlPs. As presented in Supplementary Figure S2, 
the WT GAL.AIRE fusion protein also activated 
G5HIV2dsx. To visualize AIRE-induced effects on co- 
transcriptional spHcing of this minigene, we used a 
primer pair that amplified spliced and unspliced tran- 
scripts for RT-PCR. When the WT GAL.AIRE fusion 
protein was co-expressed with G5HIV2dsx, we detected 
more spliced (lower band) than unspliced transcripts 
(upper band) compared to GAL (Figure 5B, compare 
upper and lower bands in lanes 2 and 1). Interestingly, a 
similar trend was observed when we co-expressed just the 
AIRE TAD fused to GAL (GAL.AIRE488) (Figure 5B, 
compare lanes 2 and 4). In contrast, the mutant 
GAL.AIREfsx fusion protein did not have this effect 
(Figure 5B, compare lanes 1-3). Using the plasmid 
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Figure 5. AIRE increases pre-mRNA splicing. (A) Schematic representation of the G5HIV2dsx minigene with the RT-PCR primer pair. G5HIV2dsx 
contains 5 GAL-binding sites (UAS) upstream of the Drosophila doublesex (dsx) exons 3 and 4 and the intervening sequence (intron) with an exonic 
spHcing enhancer (ESE) containing 6 GAA repeats and the SV40 poladenylation signal (pA). (B) AIRE affects sphcing of dsx pre-mRNA. HeLa cells 
co-expressed GAL or GAL fusion proteins and G5HIV2dsx. Total mRNA was isolated and sphcing was assayed by RT-PCR with a primer 
pair that amplified spHced (lower band) and unsphced (upper band) transcripts (RT-PCR). (C) Expression of fusion proteins was monitored 
with anti-GAL antibodies by western blotting (WB). GAL and GAL.AIRE fusion proteins are identified by arrows. (D) U5 snRNP is enriched 
on G5HIV2SVEDA in AIRE expressing cells. Sonicated lysates from HeLa cells co-expressing GAL or GAL.AIRE fusion protein and 
G5HIV2SVEDA were immunoprecipitated with anti-U5 (aU5) antibodies for ChlP. Increased levels of specific ampHcons are presented as fold 
enrichment over the IgG control (U5/IgG). Black and white bars represent values for the GAL.AIRE chimeras and GAL, respectively, with 
SEM depicted by error bars. G5HIV2SVEDA with amphcons is diagrammed above the bar graph (promoter, Pr; 5'-end of gene, 5'; 3'-end of 
gene, 3'; polyadenylation signal, pA). It contains five GAL binding sites (UAS) upstream of oc-globin gene (agl) with inserted fragment of fibronectin 
1 gene (FNl). 



target lacking the ESE, we observed faint bands of spliced 
transcripts when the WT GAL.AIRE or GAL.AIRE488 
fusion proteins were co-expressed, but these levels were 
much lower than those with the ESE-containing reporter 
(Supplementary Figure S4B, lanes 1-4). All GAL fusion 
proteins were expressed equivalently (Figure 5C and 
Supplementary Figure S4C). We conclude that AIRE 
increases the pre-mRNA spHcing. 

Prompted by these results and previous findings on 
co-transcriptional accumulation of cellular splicing factors 
(40), we examined if AIRE-induced increase of S2P 
stimulated the recruitment of spliceosome component 
U5 to our plasmid target. To this end, we examined the 
abundance of spliceosome subunits with anti-U5-116 kD 
antibodies by ChIP (Figure 5D, compare black and white 
bars). U5-116 kD (U5) is a subunit of U5 snRNP, which 
is a shared component of both the major U2 and minor 
U12 spliceosomes (41). Indeed, we observed an enrich- 
ment of U5 on the 3^-end of the gene in cells expressing 
the WT GAL.AIRE fusion protein. Thus, AIRE increases 
the recruitment of co-transcriptional processing factors 
to DNA that accumulate with RNAPII at the 3'-end of 
the gene. 

Inhibition of CDK9 abrogates effect of AIRE on 
pre-mRNA splicing 

To examine if AIRE-induced increase of pre-mRNA 
splicing and increased S2P levels were linked, we made 
use of a pharmacological inhibitor of P-TEFb. 



Flavopiridol (FP) binds tightly to the ATP pocket of 
CDK9 and thereby inhibits its kinase activity (42). To 
this end, we co-expressed the WT GAL.AIRE fusion 
protein or GAL with G5HIV2dsx and compared spHcing 
patterns between FP-treated and untreated cells. The 
reported half maximal inhibitory concentration of FP 
for the kinase activity CDK9 is 22 nM (43). Thus, we 
used a low concentration to minimize effects of FP on 
other CDKs. As presented in Figure 6 A (compare lanes 
2 and 1), when treating cells with FP at 25 nM, we ob- 
served a clear reduction in S2P levels in cells. We observed 
no decrease in cell proHferation after 24 h of treatment. 
The expression of the WT GAL.AIRE fusion protein 
or GAL in untreated as well as FP-treated cells was 
confirmed with specific antibodies by western blotting 
(Figure 6B). From these cells, we isolated RNA and 
used specific primers for spHced, unspliced and total dsx 
RNA (Figure 6C, upper panel) and RT-qPCR to measure 
activation of splicing by AIRE. SpHcing efficiencies were 
calculated as ratios of normahzed levels of spHced versus 
unspliced transcripts. In agreement with our earHer 
results, when we compared splicing efficiency in untreated 
ceHs, we detected a nearly 3.5-fold increased pre-mRNA 
spHcing with the WT GAL.AIRE fusion protein compared 
to GAL (Figure 6C, compare black and white bars). By 
contrast, in FP-treated cells, no such AIRE-induced 
increase was observed (Figure 6C, compare black and 
white bars). Thus, by inhibiting the kinase activity 
of CDK9, AIRE-induced enhancement of pre-mRNA 
spHcing is blocked. 
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Figure 6. Flavopiridol inhibits effects of AIRE on splicing. 
(A) Flavopiridol decreases total levels of S2P. Lysates from Hela 
cells, which were treated with 25 nM flavopiridol (FP) or DMSO only 
(C) for 24 h, were analyzed with specific antibodies by western blotting 
(WB). The upper panel presents levels of S2P. Lower panel presents 
levels of GAPDH. (B) GAL fusion proteins and G5HIV2dsx are ex- 
pressed in cells treated with FP. HeLa cells co-expressing G5HIV2dsx 
and GAL or the WT GAL.AIRE chimera were treated with 25 nM FP 
or DMSO only for 24 h as indicated. Expression of proteins was 
determined with anti-GAL antibodies by western blotting (WB). 
GAL and GAL.AIRE fusion proteins are indicated by arrows. 
(C) FP blocks AIRE-induced enhancement of pre-mRNA sphcing. 
Upper panel presents schematic representation of G5HIV2dsx 
minigene with probes used for RT-qPCR that specifically amplified 
spliced, unsphced or total dsx transcripts. Relative sphcing efficiency 
was determined and is presented for the WT GAL.AIRE chimera 
(black bars) as fold activation relative to GAL (white bars) for 
treated and untreated cells (FP, DMSO). Error bars represent SEM. 



cntrl siRNA: -i- 
CDK9siRNA: -h -H -H 
t(h): 24 24 48 84 
CDK9 I- I 



GAPDH I— ^— "] 

12 3 4 



B 



cntrl siRNA: -i- 
CDK9 SiRNA: 

GAL + 
GAL.AIRE 

galaireI 

GALt-T 



G5HIV2dsx 
UAS (GAA)6 

- ► "4 spliced 
^ ^unspliced 
^ ^ total 



Relative splicing levels 



+ + 
-I- 

-I- 



CDK9 
oc-Tubulin 



12 3 4 




cntrl 
SiRNA 



Figure 7. AIRE requires CDK9 for its effects on pre-mRNA sphcing. 

(A) Time course of CDK9 siRNA knockdown. HeLa cells were trans- 
fected with CDK9 or control siRNAs (cntrl). Subsequently, cells were 
harvested at various time-points and protein levels were determined 
with specific antibodies by western blotting. In the upper and lower 
panels, levels of CDK9 and GAPDH are presented, respectively. 

(B) Protein effectors are expressed in CDK9-knockdown cells. HeLa 
cells were co-transfected with CDK9 or control siRNAs. GAL or the 
WT GAL.AIRE chimera and G5HIV2dsx were co-expressed as 
indicated. Protein levels of expressed proteins are presented in the 
upper two panels with those of CDK9 and tubuHn in the lowest 
panels, respectively. (C) CDK9 siRNA knock-down inhibits effects of 
AIRE on splicing. Upper panel contains a schematic representation of 
the G5HIV2dsx minigene with amphcons used to amphfy spHced, 
unsphced or total dsx transcripts by RT-qPCR. Relative sphcing effi- 
ciency was determined and is presented for the WT GAL.AIRE 
chimera (black bars) as fold activation relative to GAL (white bars) 
for cells transfected with CDK9 or control siRNAs. Error bars repre- 
sent SEM. 



AIRE requires CDK9 to increase pre-mRNA splicing 

Since FP may inhibit other CDKs, to vaHdate further the 
role of CDK9 in AIRE-induced enhancement of sphcing, 
we also knocked-down CDK9 with siRNA and examined 
AIRE-induced splicing. As presented in Figure 7A, CDK9 
levels were decreased after 24 h and were depleted at 48 h. 
The corresponding siRNA and G5HIV2dsx were co- 
expressed with GAL or WT GAL.AIRE fusion proteins. 
The expression of protein effectors and CDK9 knock- 
downs were vahdated with specific antibodies by western 
blotting (Figure 7B). From these cells, RNA was isolated 
and levels of spliced and unsphced mRNAs were quanti- 
fied as in Figure 6C. When comparing splicing efficiencies 
in cells transfected with control siRNA, we also ob- 
served 4.5-fold increased pre-mRNA sphcing by AIRE 
(Figure 7C, compare black and white bars) which was 
comparable to effects in untreated cells (Figure 6C). 
Underlining the specific requirement for P-TEFb, AIRE 
had no effect when expressed in cehs depleted of CDK9 
(Figure 7C, compare black and white bars). Thus, AIRE 
requires P-TEFb to enhance pre-mRNA sphcing. 

By recruiting P-TEFb, AIRE induces splicing of an 
endogenous TRA gene 

To vahdate our model, we examined the endogenous 
KRT14 gene, which is regulated by AIRE (18). Total 
RNA was isolated from 293T cehs expressing transiently 
the empty plasmid vector, WT AIRE or mutant AIREfsx 
proteins (Figure 8B). Spliced KRT14 and KRT14 intron 



containing transcripts were amplified with specific primers 
by RT-qPCR (Figure 8A) and quantified relative to those 
from the endogenous GAPDH gene. As presented in 
Supplementary Figure S5A, the expression of KRT14 
was induced by the WT AIRE but not mutant AIREfsx 
proteins. Sphcing efficiencies were calculated as ratios of 
normalized levels of spliced versus unsphced transcripts 
and are presented as fold activation above the sphcing 
observed in the presence of only the empty plasmid 
vector, which was set to 1. WT AIRE protein increased 
the splicing efficiency 9-fold above the control (Figure 8C, 
compare black to white bars). As expected, mutant 
AIREfsx protein did not affect the splicing efficiency of 
the KRT14 gene (Figure 8C, compare striped to white 
bars). Similar results were obtained for the S100A8 gene, 
which is also induced by AIRE (data not shown). We 
conclude that the mutant AIREfsx protein affects 
neither the transcription nor the splicing of endogenous 
TRA genes, which is in agreement with our observations 
with plasmid targets. 

To examine further the role of P-TEFb in AIRE- 
induced expression of TRA genes, we compared levels 
of KRT14 mRNA from cells expressing the WT AIRE 
protein which were treated or not with FP (Figure 8D). 
Indeed, effects of AIRE were decreased by 85% in cells 
treated with FP compared to untreated cells 
(Supplementary Figure S5B, compare black bars). FP 
also inhibited AIRE-induced splicing of KRT14 
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Figure 8. By recruiting P-TEFb, AIRE induces splicing of an endogenous TRA gene. (A) Schematic representation of the KRT14 gene with primer 
pairs used to ampHfy sphced and unsphced transcripts by RT-qPCR. (B) WT AIRE and mutant AIREfsx proteins are expressed in 293T ceUs. 
Expression of proteins was determined with anti-AIRE and anti-a-tubuHn antibodies by western blotting (WB). (C) Only the WT AIRE protein 
induces spHcing of the KRT14 pre-mRNA. Relative splicing efficiency was determined and is presented for WT AIRE (black bar) and mutant 
AIREfsx (hashed bar) proteins as fold activation relative to the empty plasmid vector (C, white bar). Error bars represent SEM. (D) WT AIRE 
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AIREfsx proteins. Enrichment of specific KRT14 amphcons is presented as fold enrichment over IgG control (CDK9/IgG). White, black and striped 
bars represent values for the empty plasmid vector (C), WT AIRE and mutant AIREfsx proteins, respectively, with SEM depicted by error bars. The 
KRT14 gene and ChIP amplicons are depicted above the bar graph (promoter, Pr; 5' of coding region, 5'; 3' of coding region, 3'; polyadenylation 
signal, pA). (G) ChIP with anti-S2P (aS2P) antibodies. Expression levels for AIRE and a-tubuhn proteins are presented below the bar graph (WB). 
Lysates from transfected cells were analyzed with specific antibodies by western blotting. 



transcripts (Figure 8E, FP, compare black to white bars). 
In these cells, AIRE was expressed at comparable levels 
(Figure 8D). Thus, AIRE requires P-TEFb to induce tran- 
scription and pre-mRNA spHcing of endogenous TRA 
genes. 

These results and a previous report of the recruitment of 
P-TEFb by AIRE to endogenous genes (10) led us to 
examine if the mutant AIREfsx protein fails to do the 
same. In 293T cells, we expressed transiently the empty 
plasmid vector, WT AIRE or mutant AIREfsx proteins. 
Subsequently, we performed ChlPs with anti-CDK9 
antibodies and analyzed KRT14 DNA enrichment by 
qPCR. qPCR amplicons are presented in the schematic 
of the KRT14 gene above the bar graph (Figure 8F). 
Indeed, we observed a significant enrichment of CDK9 
at the promoter (4.5-fold), in the coding region (26- and 
14-fold) and at the polyadenylation site (6-fold) in WT 
AIRE protein-expressing cells compared to control cells 
(Figure 8F, compare black to white bars). Importantly, no 
significant enrichment was observed in mutant AIREfsx 
protein-expressing cells (Figure 8F, striped bars). With 
anti-S2P antibodies, we examined levels of S2P on the 
KRT14 gene. They were also enriched between 2- and 



4-fold in WT AIRE protein-expressing cells compared to 
control and mutant AIREfsx protein-expressing cells 
(Figure 8G, compare black to white and striped bars). 
We conclude that by recruiting P-TEFb to endogenous 
TRA genes, AIRE also increases their levels of S2P. 

DISCUSSION 

By characterizing the functional defect caused by an 
APECED patient mutation, resulting in a frame-shift at 
position 505 in AIRE (AIREfsx), we elucidated the mech- 
anisms of AIRE-induced activation of otherwise tissue- 
restricted genes. This mutation affects only the 
C-terminus of AIRE, which is highly conserved between 
mouse and human AIRE proteins. The mutant AIREfsx 
protein could not activate exogenous plasmid targets or 
representative endogenous TRA genes. This situation per- 
sisted even when AIRE proteins were targeted artificially 
to DNA via the GAL4 DNA-binding domain. This heter- 
ologous tethering also allowed us to map the minimal 
AIRE TAD, which was found to mediate interactions 
with P-TEFb. Next, the mutant AIREfsx protein could 
not recruit P-TEFb to plasmid DNA via heterologous 



Nucleic Acids Research, 2011, Vol. 39, No. 18 7917 



DNA-tethering or to an endogenous TRA gene in chro- 
matin when expressed transiently in cells. It also lost the 
abihty of its WT counterpart to increase levels of RNAPII 
and CTD phosphorylation on the body and 3^-end of 
these target genes. These increased levels of S2P could 
be correlated with those of specific transcripts and 
improved co-transcriptional pre-mRNA processing of 
minigene plasmid targets and an endogenous TRA gene 
in chromatin. Of note, when the kinase activity of CDK9 
was blocked by flavopiridol or its levels diminished by 
CDK9 siRNA, the WT AIRE protein also lost the 
ability to increase the pre-mRNA splicing of its target 
genes. In conclusion, we mapped the AIRE TAD, 
defined its interaction with P-TEFb, as well as 
characterized functional consequences of this binding on 
transcription of target genes and co-transcriptional pro- 
cessing of their nascent transcripts. 

This study confirms and extends a previous observation 
on interactions between AIRE and P-TEFb (10). 
Importantly, we observed effects of increased levels of 
S2P on co-transcriptional processing of nascent tran- 
scripts. Moreover, enhanced spHcing efficiencies and 
increased levels of steady state mRNAs correlated 
directly with CTD phosphorylation. Thus, our findings 
complement nicely a previous report, which reported 
that AIRE exerts most of its effects on gene expression 
at the level of pre-mRNA splicing (18). In their proteomic 
analysis, Abramson et al. identified 45 candidates that 
interacted with AIRE and were involved in nuclear trans- 
port, DNA-damage response, chromatin remodeling, 
transcription and pre-mRNA processing. Knockdowns 
of only 11 of these candidates impaired AIRE'S abihty 
to regulate the expression of target genes in chromatin 
and on episomal plasmids. Of interest, except for the 
Myb binding protein 1 A (MYBBPIA), all of them were 
found associated with P-TEFb and/or RNAPII in another 
study (44). Since chromatin remodehng factors, splicing 
and polyadenylation machineries bind to the elongating 
transcription complex (26), associations between AIRE, 
P-TEFb and RNAPII can better explain these proteomic 
data than the ability of AIRE to bind all these proteins 
directly. Thus, AIRE enters a long Hst of transcription 
factors that recruit P-TEFb to target genes (22). Of 
interest, mono-ubiquitylation increases the affinity of 
VP16 and other TADs for P-TEFb (37,45). Since AIRE 
is ubiquitylated (15), this post-translational modification 
could also increase interactions between AIRE and 
CDK9. 

How AIRE is targeted precisely to TRA genes in 
mTECs remains to be investigated. Since AIRE increases 
the expression of so many genes, it is clear that this 
binding has to be rather promiscuous. One aspect of this 
recruitment is the binding between PHDl and histone 
H3K4me^ (13,14,46). However, AIRE activated exogen- 
ous plasmid targets in this and other studies, on which 
chromatin does not assemble efficiently (47). 
Furthermore, mutations outside the PHDl, e.g. in the 
SAND domain, prevent the recruitment of AIRE to 
target genes (35). Irrespective of this remaining conun- 
drum, since P-TEFb does not initiate transcription, 
genes that are activated by AIRE will contain RNAPII 



already engaged on or near their promoters (48). The re- 
cruitment of P-TEFb will then modify these transcription 
complexes for elongation and co-transcriptional process- 
ing of target genes. Different levels of engaged but stalled 
transcription complexes on promoters could then explain 
why clones of primary mTECs isolated from the mouse 
thymus display different profiles of AIRE-induced TRAs 
(49). 

Our study also finds resonance with whole genome ana- 
lyses of P-TEFb target genes, for example those regulated 
by the proto-oncogene c-Myc (48). Similar to that situ- 
ation, we find that AIRE does not increase levels of 
RNAPII at the promoter but rather in the body and es- 
pecially at the 3^-end of susceptible genes. This finding sug- 
gests that P-TEFb phosphorylates preferentially subunits 
of negative elongation factors at the 5^-end of transcrip- 
tion units, which must be modified to allow RNAPII to 
elongate (22). Since P-TEFb remains associated with the 
transcription complex in the body and 3^-end of the gene, 
S2P levels will increase throughout elongation. Our ChlPs 
with anti-U5 antibodies revealed that spHceosome compo- 
nents were enriched at the 3^-end of the gene. This finding 
suggests that RNAPII accumulates at these 3^-ends to 
permit optimal co-transcriptional processing, i.e. pre- 
mRNA splicing and polyadenylation, which might not 
go to completion during the actual transcription of these 
sequences. In other words, transcription complexes might 
move too quickly for this maturation of primary tran- 
scripts to occur in situ (50). Consistent with this notion, 
two recent studies reported that RNAPII pauses for 
pre-mRNA splicing at the 3^-end of the gene in 
Saccharomyces cerevisiae (51,52). In summary, a single 
nucleotide mutation from an APECED patient leading 
to a mutant AIRE protein has allowed us to dissect these 
steps more precisely and to define how genes can be regu- 
lated in cells by a single protein, in this case for the specific 
purpose of directing central tolerance and prevent deleteri- 
ous immune responses against self, which can result in 
autoimmunity. 
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